a 



6AATTC6GA6 GAATTATTCA AAACATAAAC ACAATAAACA ATTTGAGTA6 TTCCCGCACA 60 
CACACACACA CACAGCCCGT 66ATTATTAC ACTAAAAGC6 ACACTCAATC CAAAAAATCA 120 



GCAACAAAAA CATCAATAAA C ATG CAT TGG ATT AAA TGT TTA TTA ACA GCA 

Met His Trp lie Lys Cys Leu Leu Thr Ala 
1 5 10 



17! 



TTC An TGC TTC ACA GTC ATC CTG CAG GTT CAC ACT TCC GGC AGC nT 219 
Phe He Cys Phe Thr Vat lie Vol Gin Vol His Ser Ser Gty Ser Phe 
15 20 25 

GAG TTG CGC CTG AAG TAC TTC AGC AAC GAT CAC GGG CGG GAC AAC GAG 267 
Glu Leu Arg Leu Lys Tyr Phe Ser Asn Asp His Gly Arg Asp Asn Glu 
30 35 40 

G6T CGC TGC TGC AGC GGG GAG TCG GAC GGA GCG ACG GGC AAG TGC CTG 315 
Gly Arg Cys Cys Ser Gly Glu Ser Asp Gly Ala Thr Gly Lys Cys Leu 
45 50 55 

GGC AGC TGC AAG ACG CGG TTT CGC GTC TGC CTA AAG CAC TAC CAG GCC 363 
Gly Ser Cys Lys Thr Arg Phe Arg Val Cys Leu Lys His Tyr Gin Ala 
60 65 70 

ACC ATC GAC ACC ACC TCC CAG TGC ACC TAC GGG GAC GTG ATC ACG CCC 411 
Thr lie Asp Thr Thr Ser Gin Cys Thr Tyr Gly Asp Vol He Thr Pro 
75 BO 85 90 

ATT CTC GGC GAG AAC TCG GTC AAT CTG ACC GAC GCC CAG CGC TTC CAG 459 
lie Leu Gly Glu Asn Ser Val Asn Leu Thr Asp Ala Gin Arg Phe Gin 
95 100 105 

AAC AAG GGC TTC ACG AAT CCC ATC CAG TTC CCC TTC TCG TTC TCA TGG 507 
Asn Lys Gly Phe Thr Asn Pro He Gin Phe Pro Phe Ser Phe Ser Trp 
110 115 120 
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CCG GET ACC TTC TCG CTG ATC GTC GAG GCC TGG CAT GAT ACG AAC AAT 
Pro Gly Thr Phe Ser Leu He Val Glu Ala Trp His Asp Thr Asn Asn 
125 130 135 

AGC GGC AAT GCG CGA ACC AAC AAG CTC CTC ATC CAG CGA CTC TTG GTG 603 
Ser Gly Asn Ala Arg Thr Asn Lys Leu Leu He Gin Arg Leu Leu Val 
140 " 145 150 

CAG CAG GTA CTG GAG GTG TCC TCC GAA TGG AAG ACG AAC AAG TCG GAA 651 
Gin Gin Val Leu Glu Val Ser Ser Glu Trp Lys Thr Asn Lys Ser Glu 
155 160 165 170 

TCG CAG TAC ACG TCG CTG GAG TAC GAT TTC CGT GTC ACC TGC GAT CTC 699 
Ser Gin Tyr Thr Ser Leu Glu Tyr Asp Phe Arg Val Thr Cys Asp Leu 
175 180 185 

AAC TAC TAC GGA TCC GGC TGT GCC AAG TTC TGC CGG CCC CGC GAC GAT 747 
Asn Tyr Tyr Gly Ser Gly Cys Ala Lys Phe Cys Arg Pro Arg Asp Asp 
190 195 200 

TCA TTT GGA CAC TCG ACT TGC TCG GAG ACG GGC GAA ATT ATC TGT TTG 795 
Ser Phe Gly His Ser Thr Cys Ser Glu Thr Gly Glu He lie Cys Leu 
205 210 215 

ACC GGA TGG CAG GGC GAT TAC TGT CAC ATA CCC AAA TGC GCC AAA GGC 843 
Thr Gly Trp Gin Gly Asp Tyr Cys His lie Pro Lys Cys Ala Lys Gly 
220 225 230 

TGT GAA CAT GGA CAT TGC GAC AAA CCC AAT CAA TGC GTT TGC CAA CTG 891 
Cys Glu His Gly His Cys Asp Lys Pro Asn Gin Cys Val Cys 6ln Leu 
235 240 245 250 

GGC TGG AAG GGA GCC TTG TGC AAC GAG TGC GTT CTG GAA CCG AAC TGC 939 
Gly Trp Lys Gly Ala Leu Cys Asn Glu Cys Val Leu Glu Pro Asn Cys 
255 260 265 
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ATC CAT GGC ACC TGC AAC AAA CCC TG6 ACT T6C ATC TGC AAC GAG GGT 
He His Gly Thr Cys Asn Lys Pro Trp Thr Cys He Cys Asn Glu Gly 
270 275 280 

TGG GGA GGC TTG TAC TGC AAC CAG GAT CTG AAC FAC TGC ACC AAC CAC 
Trp Gly Gly Leu Tyr Cys Asn Gin Asp Leu Asn Tyr Cys Thr Asn His 
285 290 295 

AGA CCC TGC AAG AAT GGC GGA ACC TGC TTC AAC ACC GGC GAG GGA TTG 
Arg Pro Cys Lys Asn Gly Gly Thr Cys Phe Asn Thr Gly Glu Gly Leu 
300 305 310 

TAC ACA TGC AAA TGC GCT CCA GGA TAC AGT GGT GAT GAT TGC GAA AAT 
Tyr Thr Cys Lys Cys Ala Pro Gly Tyr Ser Gly Asp Asp Cys Glu Asn 
315 320 325 330 

GAG ATC TAC TCC TGC GAT GCC GAT GTC AAT CCC TGC CAG AAT GGT GGT 
Glu He Tyr Ser Cys Asp Ala Asp Val Asn Pro Cys Gin Asn Gly Gly 
335 340 345 

ACC TGC ATC GAT GAG CCG CAC ACA AAA ACC GGC TAC AA6 TGT CAT TGC 
Thr Cys I le Asp Glu Pro His Tlr Lys Thr Gly Tyr Lys Cys His Cys 
350 355 360 

GCC AAC GGC TGG AGC GGA AAG ATG TGC GAG GAG AAA GTG CTC ACG TGT 
Ala Asn Gly Trp Ser Gly Lys Met Cys Glu Glu Lys Vd Leu Thr Cys 
365 370 375 

TCG GAC AAA CCC TGT CAT CAG GGA ATC TGC CGC AAC GTT CGT CCT GGC 
Ser Asp Lys Pro Cys His Gin Gly He Cys Arg Asn Val Arg Pro Gly 
380 385 .390 

TTG GGA AGC AAG GGT CAG GGC TAC CAG TGC GAA TGT CCC ATT GGC TAC 
Leu Gly Ser Lys Gly Gin Gly Tyr Gin Cys Glu Cys Pro lie Gly Tyr 
395 400 4 05 410 
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AGC GGA CCC AAC T6C GAT CTC CAG CTG GAC AAC TGC AGT CCG AAT CCA 
Ser Gly Pro Asn Cys Asp Leu Gin Leu Asp Asn Cys Ser Pro Asn Pro 
415 420 425 
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TGC ATA AAC GGT GGA AGC TGT CAG CCG AGC GGA AAG TGT ATT TGC CCA 1467 
Cys He Asn Gly Gly Ser Cys Gin Pro Ser Gly Lys Cys He Cys Pro 
43 0 435 440 

GCG GGA TTT TCG GGA ACG AGA TGC GAG ACC AAC ATT GAC GAT TGT CTT 1515 
Ala Gly Phe Ser Gly Thr Arg Cys Glu Thr Asn He Asp Asp Cys Leu 
445 450 455 

GGC CAC CAG TGC GAG AAC GGA GGC ACC TGC ATA GAT ATG GTC AAC CAA 1563 
Gly His Gin Cys Glu Asn Gly Gly Thr Cys He Asp Met Val Asn Gin 
460 465 470 

TAT CGC TGC CAA TGC GTT CCC GGT TTC CAT GGC ACC CAC TGT AGT AGC 1611 
Tyr Aro Cys Gin Cys Vol Pro Gly Phe His Gly Thr His Cys Ser Ser 
475 480 485 490 

AAA GTT GAC TTG TGC CTC ATC AGA CCG TGT GCC AAT GGA GGA ACC TGC 1659 
Lys Vol Asp Leu Cys Leu He Arg Pro Cys Ala Asn Gly Gly Thr Cys 
495 500 . 505 

TTG AAT CTC AAC AAC GAT TAC CAG TGC ACC TGT CGT GCG GGA TTT ACT 1707 
Leu Asn Leu Asn Asn Asp Tyr Gin Cys Thr Cys Arg Ala Gly Phe Thr 
510 515 520 

GGC AAG GAT TGC TCT GTG GAC ATC GAT GAG TGC AGC AGT GGA CCC TGT 1755 
Gly Lys Asp Cys ier Yal Asp lie Asp Glu Cys Ser Ser Gly Pro Cys 
525 530 535 

CAT AAC GGC GGC ACT TGC ATG AAC CGC GTC AAT TCG TTC GAA TGC GTG 1803 
His Asn Gly Gly Thr Cys Het Asn Arg Vol Asn Ser Phe Glu Cys Vol 
540 545 550 
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GCC AAT GGT HC AGG GGC AAG CAG TGC GAT GAG GAG TCC TAC GAT 
Ala Asn Gly Phe Arg Gly Lys Gin Cys Asp Glu Glu Ser Tyr Asp 
560 565 570 



TCC GTG ACC TTC GAT GCC CAC CAA TAT GGA GCG ADC ACA CAA GCG AGA 
Ser Val Thr Phe Asp Ala His Gin Tyr Gly Ala Thr Thr Gin Ala Arg 
575 580 585 

GCC GAT GGT TTC ACC AAT GCC CAG GTA GTC CTA An GCT GTT TTC TCC 
Ala Asp Gly Leu Thr Asn Ala Gin Val Val Leu He Ala Val Phe Ser 
590 595 600 

GTT GCG ATG CCT TTG GTG GCG GTT ATT GCG GCG TGC GTG GTC TTC TGC 
Val Ala Met Pro Leu Val Ala Val He Ala Ala Cys Val Val Phe Cys 
605 610 615 

ATG AAG CGC AAG CGT AAG CGT GCT CAG GAA AAG GAC GAC GCG GAG GCC 
Met Lys Arg Lys Arg Lys Arg Ala Gin Glu Lys Asp Asp Ala Glu Ala 
620 625 630 

AGG AAG CAG AAC GAA CAG AAT GCG GTG GCC ACA ATG CAT CAC AAT GGC 
Aro Lys Gin Asn Glu Gin Asn Ala Val Ala Thr M His His Asn Gly 
635 640 645 650 

AGT GGG GTG GGT GTA GCT TTG GCT TCA GCC TCT CTG GGC GGC AAA ACT 
Ser Gly Val Gly Val Ala Leu Ala Ser Ala Ser Leu Gly Gly Lys Thr 
655 660 665 

GGC AGC AAC AGC GGT CTC ACC TTC GAT GGC GGC AAC CCG AAT ATC ATC 
Gly Ser Asn Ser Gly Leu Thr Phe Asp Gly Gly Asn Pro Asn lie He 
670 675 680 

AAA AAC ACC TGG GAC AAG TCG GTC AAC AAC ATT TOT GCC TCA GCA GCA- 
Lvs Asn Thr Trp Asp Lys Ser Yal Asn Asn He Cys Ala Ser Ala Ala 
685 690 695 
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GCA GCG GCG GCG GCG GCA GCA GCG GCG GAC GAG TGT CTC ATG TAC GGC 2283 
Ala Ala Ala Ala Ala Ala Ala Ala Ala Asp Glu Cys Leu Het Tyr Gly 
700 705 710 

GGA TAT GTG GCC TCG GTG GCG GAT AAC AAC AAT GCC AAC TCA GAC TTT 2331 
Glv Tvr Val Ala Ser Vat Ala Asp Asn Asn Asn Ala Asn Ser Asp Phe 
715 720 725 730 

TGT GTG GCT CCG CTA CAA AGA GC€ AAG TCG CAA AAG CAA CTC AAC ACC 2379 
Cvs Val Ala Pro Leu Gin Arg Ala Lys Ser Gin Lys Gin Leu Asn Thr 
735 740 745 

GAT CCD ACG CTC ATG CAC CGC GGT TCG CCG GCA GGC AGC TCA GCC AAG 2427 
Asp Pro Thr Leu Met His Arg Gly Ser Pro Ala Gly Ser Ser Ala Lys 
750 755 760 

GGA GCG TCT GGC GGA GGA CCG GGA GCG GCG GAG GGC AAG AGG ATC TCT 2475 
Gly Ala Ser Gly Gly Gly Pro Gly Ala Ala Glu Gly Lys Arg lie Ser 
765 770 775 

GTT TTA GGC GAG GGT TCC TAC TGT AGC CAG CGT TGG CCC TCG TTG GCG 2523 
Val Leu Gly Glu Gly Ser Tyr Cys Ser Gin Arg Trp Pro Ser Leu Ala 
780 ' 785 790 

GCG GCG GGA GTG GCC GGA GCC TGT TCA TCC CAG CTA ATG GCT GCA GCT 2571 
Ala Ala Gly Val Ala Gly Ala Cys Ser Ser Gin Leu Met Ala Ala Ala 
795 800 B05 810 

TCG GCA GCG GGC AGC GGA GCG GGG ACG GCG CAA CAG CAG CGA TCC GTG 2619 
Ser Ala Ala Gly Ser Gly Ala Gly Thr Ala Gin Gin Gin Aro, Ser Val 
815 820 825 

GTC TGC GGC ACT CCG CAT ATG TAACTCCAAA AATCCGGAAG GGCTCCTGGT 2670 
Val Cys Gly Thr Pro His Met 
830 

AAATCCGGAG AAATCCGCAT GGAGGAGCTG ACAGCACATA CACAAAGAAA AGACTGGGTT 2730 

GGGTTCAAAA TGTGAGAGAG ACGCCAAAAT GTTGTTGTTG ATTGAAGCAG TTTAGTCGTC 2790 

ACGAAAAATG AAAAATCTGT AACAGGCATA ACTCGTAAAC TCCCTAAAAA ATTTGTATAG- 2850 

TAATTAGCAA AGCTGTGACC CAGCCGTTTC GATCCCGAAT TC 2892 
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10 20 30 40 50 60 <%> ^ Q 

GMTTCCCCT CCCCCCTTTT TCCATGCAGC T6ATCTAAAA GGGAATAAAA GGCTGCGCAT ^ cP \J 

70 80 90 100 110 120 

AATCATAATA ATAAAAGAAG GGGAGCGCGA GAGAAGGAAA GAAAGCCGGG AGGTGGAAGA 
130 140 150 160 170 180 

GGAGGGGGAG CGTCTCAAAG AAGCGATCAG AATAATAAAA GGAGGCCGGG CTCTTTGCCT 
190 200 210 220 230 240 

TCTGGAAGGG GCCGCTCTTG AAAGGGCTTT TGAAAAGTGG TGTTGTTTTC CAGTCGTGCA 
250 260 270 280 290 300 

TGCTCCAATC GGCGGAGTAT ATTAGAGCCG GGACGCGGCC GCAGGGGCAG CGGCGACGGC 
310 320 330 340 350 360 

AGCACCGGCG GCAGCACCAG CGCGAACAGC AGCGGCGGCG TCCCGAGTGC CCGCGGCGGC 
370 380 390 400 410 420 

GCGCGCAGCG ATGCGTTCCC CACGGACACG CGGCCGGTCC GGGCGCCCCC TAAGCCTCCT 
MRS PRTR GRS GRP LSLL> 
430 440 450 460 470 480 

GCTCGCCCTG CTCTGTGCCC TGCGAGCCAA GGTGTGTGGG GCCTCGGGTC AGTTCGAGTT 
L A L L C A L R A K V C G A S G Q F E L> 
490 500 510 520 530 540 

GGAGATCCTG TCCATGCAGA ACGTGAACGG GGAGCTGCAG AACGGGAACT GCTGCGGCGG 
c t L SMQ NVNG ELO NGN CCGG> 
550 560 570 580 590 600 

CGCCCGGAAC CCGGGAGACG GCAAGTGCAC CCGCGACGAG TGTGACACAT ACTTCAAAGT 
A R N P G 0 R K C T R D E C D T Y F K V> 
610 620 630 640 650 660 

GTGCCTCAAG GAGTATCAGT CCCGCGTCAC GGCCGGGGGG CCCTGCAGCT TCGGCTCAGG 
C L K E Y Q S R V T A G G P C S F G S G> 
670 680 690 700 710 720 

GTCCACGCCT GTCATCGGGG GCAACACCTT CAACCTCAAG GCCAGCCGCG GCAACGACCC 
STP VIG GNTF NLK ASP GNDP> 
730 740 750 760 770 780 

GAACCGCATC GTGCT6CCTT TCAGTTTCGC CTGGCCGAGG TCCTATACGT TGCTTGTGGA 
MRI VLP FSFA WPR SYT L t V E> 
790 800 810 820 830 840 

GGCGTGGGAT TCCAGTAATG ACACCGTTCA ACCTGACAGT ATTATTGAAA AGGCTTCTCA 
AWD SSN DTVQ POS 1 I E A S H> 
850 860 870 880 890 900 

CTCGGGCATG ATCAACCCCA GCCGGCAGTG GCAGACGCTG AAGCAGAACA CGGGCGTTGC 
S6M 1NP SRQW QTL KQN TGVA> 
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o^n 930 940 950 960 <$k * <<\ 

CCACTTT6AG WCAGAtIc GCGTGACCTG TGATGACTAC TACTATGGCT TT Gf TGTM 

" F ™ Y ° 980 990 1000 1010 102O 

TMGTTCT6C CGaCCAGAG ATGACTTCTT TGGACACTAT GCCTGTCACC AGAATGG^ €> 

clc^ 2 AJWOT ~C CGAATGTAAC AGAGCTAm GGGGACMK 

CT LJS a«atS cttgcamct cccaggtog ™a agtacggctc 

j^CCTG TACTcS ACACGCS ™ GCATGTGTM 

^ ^1210 ^ ^ 1220 1230 H U CV 12S0 G .1260 
;CTGG CAGTGCCTCT GTGAGACCM 
> W QCL CETN 
1270 1280 1290 

OGT GGGACTCATC AGCCGTGTCT 
f C G T H Q P C L 
1330 1340 1350 

\ATAT CAGTGTTCCT GCCCTGAGGG 
C Y Q C S C P E G 

GCACGCCTGC CTCTCTGATC CCTCTCaSa CAGAGGcS TGTAAGGAGA CCIOXTM 



CAGTGCCTCT GTGAGACCAA C«8» CAGCTCTGTG ACAAAGATCT 

E P „7n ° 1280 290 1300 1310 1320 

CAATTACTGT GGGACTCATC AGCCGTGTCT CAACGGGGGA ACTTGTAGCA ACACAGGCC 

" \L G T !340 ° " 350 1370 1380 

TGACAAATAT CAGTGTTCCT GCCCTGAGGG GTATTCAGGA CCCAACTGTG AAATTGCTG^ 

" \L " C ,L C ' 41 V S l«0 P 1430 H40 



:acgccigl LiLitiwiu ^.u.^rv. _ T s L g> 

H % 1 5 H60 ' ' 47 G 14l 0 C H90 1500 

CTTTGAGTGT GAGTGTTCCC CAGGCTGGAC CGGCCCCACA TGCTCTACAA A«TTGA™ 

F \L ' C ,L/ 1530 1540 1550 1560 
CTGTTCTCCT MTAACTGTT ««» CACCTGC« GACCTGGTTA ACGOATTTAA 



C M 1590 1590 1600 1610 1620 

GTGTGTGTGC CCCCCACAGT GGACTGGGAA AACGTGCCAG TTAGATGCAA ATGAATGTGA^ 

GGCCAAACCT TCTCtS CCAAATCCTG TAAGAATCTC ATTGCC^T ACTAcS 
AKP CV N AKSC KNL lAi 
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lfi qn 1700 1710 1720 1730 1740 



™? «ST ? Ti™ ™ TST ™ 

, 7 cn 1760 1770 1780 1790 

GTGTCAGAAT GACGCCKCT GTCGGGATTT GGTTAATGGT TATCGCTGTA TCTGTCCACC 

° ° N ° A ,M U 1840 1850 I860 

TGGCTATGCA GGCGATCACT GTGAGAGAGA CATCGATGAA TGTGCCAGCA ACCCCTGTH 

G \mn G ° 1860 C E W90 1900 1910 1920 

GMTGG^ CACTGtSgA ATGMATCAA CAGATTCCAG TGTCTGTGTC CCACTGGm 

" G 19 G 30 1940 " 95 F 19^0 ' 1970 1960 

™? CTCTGT^C tggacatcga ttattgtgag cctaatccct gccagaacgg^ 

S G ,M0 L C 2 L L 2010 2020 2030 m l 

tgcccaS tacaaS ccagtgacta tttctgcaag tcccccgagg actatg«g 

A V50 ¥ " 2060 ' S °207 C 2080 C 2090 2100 

cmgaactIc tcacaS mm** <xwo«c ccctgtgaag tgattgam 

K "mo S " H20 * D 2!30 2140 2!50 2160 

CTGCACAGTG GCCATGGCTT OAACGAOC ACCTGAAGG^ GTGCGGTATA TnCCTCCAA 

C T ,^n ' " 2180 5 " 2190 2200 " 22!0 2 22 0 

CGTCtS CCTCACGGGA AGTGCAAGAG TO^TCGGGA GGCAAATTCA CCTCTGACTG 

V C 2 /3, f " 2240 2250 ° »> ' 3270 2 2 80 
TAACAA^ TTCACGGGAA CATACTGCCA TGAAAATATT AATG^TGTG AOAGCAACCC 

" ^2290 2300 T 23lS 2330 2 340 

ttgtaS ggtggcactt gcatcgatgg tgtcaactcc taca^tgca TCTGTAGTGA 

C R 2350 G G 2360 C ' 2370 " » 2390 H00 

cggctS ggggcctS gtgaaaccaa tattaatgac ™* ACCCCTGCCA 

G \L G ' 2!» ° ' 243^ 2440 2450 2460 

cmtgS ACGTGTCGCG acctggtcaa TGACTTCTAC TGTGACTGTA AWATWre 
NGG TCR DLVM DFY CDC & n b « 
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2470 2480 2490 2500 2510 2520 ^ 

GAAAGGAAAG ACCTGCCACT CACGTGACAG TCAGTGTGAT GAGGCCACGT GCAACAACGG 
K G K TCH SRDS QCD EAT CNNG> & 
2530 2540 2550 2560 2570 2580 

TGGCACCTGC TATGATGAGG GGGATGCTTT TAAGTGCATG TGTCCTGGCG GCTGGGAAGG 
GTC YDE GDAF KCM CPG GWEG> 
2590 2600 2610 2620 2630 2640 

AACAACCTGT AACATAGCCC GAAACAGTAG CTGCCTGCCC AACCCCTGCC ATAATGGGGG 
TTC N I A RNSS CLP NPC HNGG> 
2650 2660 2670 2680 2690 2700 

CACATGTGTG GTCAACGGCG AGTCCTTTAC GTGCGTCTGC AAGGAAGGCT GGGAGGGGCC 
TCV VNG ESFT CVC KEG WEGP> 
2710 2720 2730 2740 2750 2760 

CATCTGTGCT CAGAATACCA ATGACTGCAG CCCTCATCCC TGTTACAACA GCGGCACCTG 
ICA QNT NDCS PHP CYN SGTO 
2770 2780 2790 2800 2810 2820 

TGTGGATGGA GACAACTGGT ACCGGTGCGA ATGTGCCCCG GGTTTTGCTG GGCCCGACTG 
VDG DNW YRCE CAP GFA GPDC> 
2830 2840 2850 2860 2870 2880 

CAGAATAAAC ATCAATGAAT GCCAGTCTTC ACCTTGTGCC TTTGGAGCGA CCTGTGTGGA 
RIN INE CQSS PCA FGA TCVD> 
2890 2900 2910 2920 2930 2940 

TGAGATCAAT GGCTACCGGT GTGTCTGCCC TCCAGGGCAC AGTGGTGCCA AGTGCCAGGA 
EIN GYR CVCP PGH SGA KCQE> 
2950 2960 2970 2980 2990 3000 

AGTTTCAGGG AGACCTTGCA TCACCATGGG GAGTGTGATA CCAGATGGGG CCAAATGGGA 
VSG RPC I T M G SVI PDG A K W D> 
3010 3020 3030 3040 3050 3060 

TGATGACTGT AATACCTGCC AGTGCCTGAA TGGACGGATC GCCTGCTCAA AGGTCTGGTG 
DDC NTC QCLN GRI ACS KVWC> 
3070 3080 3090 3100 3110 3120 

TGGCCCTCGA CCHGCCTGC TCCACAAAGG GCACAGCGAG TGCCCCAGCG GGCAGAGCTG 
GPR PCL LHKG HSE CPS GQSC> 
3130 3140 3150 3160 3170 3180 
CATCCCCATC CTGGACGACC AGTGCTTCGT CCACCCCTGC ACTGGTGTGG GCGAGTGTCG 
IPI LDD QCFV HPC TGV GECR> 
3190 3200 3210 3220 3230 3240 
GTCTTCCAGT CTCCAGCCGG TGAAGACAAA GTGCACCTCT GACTCCTATT ACCAGGATAA 
SSS IQP V K T K CTS D SY Y Q D N> 
3250 3260 3270 3280 3290 3300 

CTGTGCGAAC ATCACATTTA CCTTTAACAA GGAGAT6ATG TCACCAGGTC TTACTACGGA 
CAN ITF TFNK EMM SPG LTTE> 
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•win 3320 3330 3340 3350 3360 ^ 
GCACATTTGC AGTGAATTGA GGAATTTGAA TATT7TGAAG AATGTTTCCG CTGAATATTC ^ 
it t p CPI RNLN ILK NVb alto 
« 3380 3390 3400 3410 3420 

AATCTA^TC GCTTGCGAGC CTTCCCCTTC AGCGAACAAT GAMTACATC TGGCCATTTC 
tvt ft r F PSPS ANN tin y * Jl 
14M 3440 3450 3460 3470 3480 
TGCT6AAGAT ATACGGGATG ATGGGAACCC GATCAAGGAA ATCACTGACA AMTMTCGA 

A E M ° W 1 R 3°500 D G 35!0 1 » ' »S 
TCTTGTTACT AAACGTGATG GAAACAGCTC GCTGATTGCT GCCGTTGAAG MGTAAGAGT 

ii/T It R n GNSS LIA AVE EVKV» 
1550 3560 3570 3580 3590 3600 
TCAGAGGCGG CCTCTGAAGA ACAGAACAGA TTTCCTTGTT CCCTTGCTGA GCTCTGTCTT 

0 \5o " L 36 2 o" " » ' ^0 ' 3650 3660 
AACTGTGGCT TGGATCTGTT GCTTGGTGAC GGCCTTCTAC TGGTGCCTGC <SA«GGCG 

T V 36 5„ " ' 3680 C ' 3690 * VoO " ' 3710 3720 
GAAGCCGGGC AGCCACACAC ACTCAGCCTC TGAGGACAAC ACCACCAACA ACGTGCGGGA 
IPC SHT H S A 5 EDN TTM N V K t 
1710 3740 3750 3760 3770 3780 

GCAGCTGAAC CAGATCAAM ACCCCATTGA GAAACATGGG GCCAACACGG JCCOATCAA 
ni m OTIC NPI E KHG ANT VKiN 
™o 3800 3810 3820 3830 3840 

ggattaEgag mcaagaact ccaaaatgtc taaaataagg acacacaatt ctgaagtaga 

° Uo " " 3860 S " 3870 " » 3890 3900 

AGAGGACSC ATGGACAAAC ACCAGCAGAA AGCCCGGTTT GCCAAGCAGC CGGCGTACAC 

cnn m n K HOQK ARF A K Q kati 

E 19?0 3920 3930 3940 3950 3960 
GCTGGTAGAC AGAGAAGAGA AGCCCCCCM CGGCACGCCG AWAAACACC CAAACTG6AC 

tun n c c KPPN GTP TkH k h w 
Z 3%0 3990 4000 4010 4020 

AAACAAACAG GACAACAGAG ACTTGGAAAG TGCCCAGAGC TTAAACCGAA TGGAGTACAT 

Mi^n n NI R DLES AQS LNK M t t 
4030 4040 4050 4060 4070 4080 

CGTATAGCAG ACCGCGGGCA CTGCCGCCGC TAGGTAGAGT CTGAGGGCTT GTAGTTCTTT 
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4090 4100 4110 4120 4130 4140 <^ 

AAACTGTCGT GTCATACTCG AGTCTGAGGC CGTTGCTGAC TTAGAATCCC TGTGTTAATT ^ 

4150 4160 4170 4180 4190 4200 q? 

TAGTTTGACA AGCTGGCTTA CACTGGCAAT GGTAGTTCTG TGGTTGGCTG GGAAATCGAG 

4210 4220 4230 4240 4250 4250 

TGGCGCATGT CACAGCTATG CAAAAAGCTA GTCAACAGTA CCCCTGGTTG TGTGTCCCCT 

4270 4280 4290 4300 4310 4320 

TGCAGCCGAC ACGGTCTCGG ATCAGGCTCC CAGGAGCTGC CCAGCCCCCT GGTACTTTGA 

4330 4340 4350 4360 4370 4380 

GCTCCCACTT CTGCCAGATG TCTAATGGTG ATGCAGTCTT AGATCATAGT TTTATTTATA 

4390 4400 4410 4420 4430 4440 

TTTATTGACT CTTGAGTTGT TTTTGTATAT TG6TTTTATG ATGACGTACA AGTAGTTCTG 

4450 4460 4470 4480 4490 4500 

TATTTGAAAG TGCCTTTGCA GCTCAGAACC ACAGCAACGA TCACAAATGA CTTTATTATT 

4510 4520 4530 4540 4550 4560 

TATTTTT7TT AATTGTATTT TTGTTGTTGG GGGAGGGGAG ACTTTGATGT CAGCAGTTGC 

4570 4580 4590 4600 4610 4620 

TGGTAAAATG AAGAATTTAA AGAAAAAATG TCCAAAAGTA GAACTTTGTA TAGTTATGTA 

4630 4640 4650 4660 4670 4680 

AATAATTCTT TTTTATTAAT CACTGTGTAT ATTTGATTTA TTAACTTAAT AATCAAGAGC 

4690 4700 4710 4720 4730 4740 

CTTAAAACAT CATTCCTTTT TATTTATATG TATGTGTTTA GAATTGAAGG TTTTTGATAG 

4750 4760 4770 4780 4790 4800 

CATTGTAAGC GTATGGCTTT AT7TTTTTGA ACTCTTCTCA TTACTTGTTG CCTATAAGCC 

4810 4820 4830 4840 4850 4860 

AAAAAGGAAA GGGTGTTTTG AAAATAGTTT ATTTTAAAAC AATAGGATGG GCTACACGTA 

4870 4880 4890 4900 4910 4920 

CATAGGTAAA TAATAGCACC GTACTGGTTA TGATGATGAA AATAACTGGA AACTTGAAAG 

4930 4940 4950 4960 4970 4980 
CTTGTGGTAA TGGCAGATAA AGATGG7TCA CCTGGGAAAT TAAAACTTGA ATGGTTGTAC 

4990 5000 5010 5020 5030 5040 

AGAAAAGCAC AGAGTGGAAT GCACATCAAT GACAGTAAGG GAGTTAGTTC TAGGAACAGC 

5050 5060 5070 5080 5090 5100 

TCCTGAACAG TAAGATTCCC GCAATAGTCT CCGCCTCGTT CGTCTATGGT ATGCATCCCA 

5110 5120 5130 5140 5150 5160 

TTCATTTTCT TCTTCTGATT ATTGTCATCT TTCCCTTTGC CAAATGGGCA GTTATTGTTT 

5170 5180 5190 5200 5210 5220 

CAGGGAGAGA AGCTGCTCAT TGGCCAATCA TTCTGGTGTG CAGTGCTCCA TCGGATTCTA 

5230 5240 5250 5260 5270 5280 

CATGTCCAAC AAGGCATGTC TGGATGATGC AATGTCTGTC TGACCCCCGG AATTCCGTGC 
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«nn 5310 5320 5330 5340 & 

**5K TTCTaS ATATAc|t TTTATTATTA ACAAACTTT6 GCCACAACCT 



.CATGTATA AATTGCCGGA — CCTTTCATTS TG6CTrT6Gft CA^ 

ctcacttItc Tscncg -tttctc ™ cctcagttct tacttattta 

TTTATTTTCA GTGG^A GG.CCTCTT ^ ~ «CTTTCTT 

iATA AAAACGGGCT GAAAAAGGAG 
i600 5610 5620 

VGAA GACAGCCTTT AA6CCTCTT6 
S660 5670 5680 

~GGG TCTGATCTTC CAGAGGAGTT 
S720 5730 5740 

DTGC TGCTGATGTT GCAGTTTTGA 
5780 5790 5800 

GAGT CTGTCACTGA CATTTTTTC/> 
5840 5850 586C 

JGTG TCAGTTGAGA GTTCACAATC 
5900 5910 592C 

TCTG TGTGAGTAAC CTGTAAAAG/ 
5960 5970 598( 

lAAAA ATCTTTATAA ACTTTCCTA/ 
6020 6030 6041 

TGTAC CCTGTCTCCA CCAAAAATAI 
6080 6090 610' 

bbib^MLM^ u«.rr.rCCCA GCAACTCTQG ww 

«J52 ™ *™g ™ «™ 

sc^ccr gtctcaS taaaaS taaataaata aattagccag «»» 

cacgcS ™; ™ T ™ «™ GCCTGAGAGG 



ATTGAAATGG TCACaS AAAAc|a —0 AGTTCCAGGA GAAAAGCCCA 
CTCCtS GACAGcgr AAGCCTCTTC CTTACTGAAG GA*CCCCAC 

CGGG TCTGATCTTC CAGAGGAGTT 
5720 5730 5740 

,CTGC TGCTGATGTT GCAGTTTTGA 
5780 5790 5800 

"GAGT CTGTCACTGA CATTTTTTCA 
5840 5850 5860 

GTG TCAGTTGAGA GTTCACAATC 
5900 5910 5920 

rTCTG TGTGAGTAAC CTGTAAAAGA 
5960 5970 5980 

bttb ilium . «nr, <AAAAA ATCTTTATAA 60 tjy ouou 

TATTTGGAGA GAAAaS CCTGtS CCAAAAATAC AAAAATTAGC 
GGTGCACACC GGTAaS CCAAcS ACACTA^GC — GCTTGACCCA 



CHCnS TGAGGCS TCTGATCTTC C^TT GGA^AGTCC ATGAGAATGG 

CCACCATTCT TGCTTGCTGC TGCTGATGTT GCAGTTTTGA GAGAACAGCG GGATCCTTGT 

TGAGT CTGTCACTGA CATTTTTTCA 
5840 5850 5860 

j\TGTG TCAGTTGAGA GTTCACAATC 

TCMTTTc!g « CTGTAAAAGA CACCTTTCCA GAAOAGTTTT 
CCCGTCTGTT TGAAaT ATCTTT g ACTTTCCTAA GTATCTGGAT TT.ATTCCT 



CCAtLAMLl Q 5 g 10 WU 

TGTCCTCTAG AGACtS CTGTCACTGA CATTTTTTCA GTTCCTTTGC TCATAWCCA 
™*3S TAGTGA S TCAGtX BHCACMTC TCATTGTTCA TTTAATTCAC 



«*SS AGTGAGTCAT GTTTgS CTGCACTCCA GC~ CAGCAAGA 

6370 
CTAA 

ttgggS gaggttSa taatccSS GCTCTGGGAA TTCC 
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GGCCGGGGCC GGGCGGGCGG GTCGCGGGGG CAATGCGGGC GCAGGGCCGG GGGCGCCTTC 60 ^ ^ ^ 

CCCGGCGGCT GCTGCTGCTG CTGGCGCTCT GGGTGCAGGC GGCGCGGCCC ATGGGCTATT 120 

TCGAGCTGCA GCTGAGCGCG CTGCGGAACG TGAACGGGGA GCTGCTGAGC GGCGCCTGCT 180 

GTGACGGCGA CGGCCGGACA ACGCGCGCGG GGGGCTGCGG CCACGACGAG TGCGACACCG 240 

CTCCTTTACC CTCATCGTGG AGGCCTGGGA CTGGGACAAC GATACCACCC CGAATGAGGA 300 

GCTGCTGATC GAGCGAGTGT CGCATGCCGG C ATG ATC AAC CCG GAG GAC CGC 352 

Met He Asn Pro Glu Asp Arg 
1 5 

TGG AAG AGC CTG CAC TTC AGC GGC CAC GTG GCG CAC CTG GAG CTG CAG 400 
Trp Lys Ser Leu His Phe Ser Gly His Val Ala His Leu Glu Leu Gin 

10 15 20 

ATC CGC GTG CGC TGC GAC GAG AAC TAC TAC AGC GCC ACT TGC AAC AAG 448 
He Arg Val Arg Cys Asp Glu Asn Tyr Tyr Ser Ala Thr Cys Asn Lys 

25 30 35 

TTC TGC CGG CCC CGC AAT GAC TTT TTC GGC CAC TAC ACC TGC GAC CAG 496 
Phe Cys Arg Pro Arg Asn Asp Phe Phe Gly His Tyr Thr Cys Asp Gin 

45 50 
TAC GGC AAC AAG GCC TGC ATG GAC GGC TGG ATG GGC AAG GAG TGC AAG 544 
Tyr Gly Asn Lys Ala Cys Met Asp Gly Trp Met Gly Lys Glu Cys Lys 

60 65 70 

GAA GCT GTG TGT AAA CAA GGG TGT AAT TTG CTC CAC GGG GGA TGC ACC 592 
Glu Ala Val Cys Lys Gin Gly Cys Asn Leu Leu His Gly Gly Cys Thr 

75 BO 85 N 

GTG CCT GGG GAG TGC AGG TGC AGC TAC GGC TGG CAA GGG AGG TTC TGC 640 
Val Pro Gly Glu Cys Arg Cys Ser Tyr Gly Trp Gin Gly Arg Phe Cys 

90 95 100 

GAT GAG TGT GTC CCC TAC CCC GGC TGC GTG CAT GGC AGT TGT GTG GAG 688 
Asp Glu Cys Val Pro Tyr Pro Gly Cys Val His Gly Ser Cys Val Glu 

105 HO n5 

CCC TGG CAG TGC AAC TGT GAG ACC AAC TGG GGC GGC CTG CTC TGT GAC 736 
Pro Trp Gin Cys Asn Cys Glu Thr Asn Trp Gly Gly Leu Leu Cys Asp 
i?n 125 130 I 35 

AAA GAC CTG AAC TAC TGT GGC AGC CAC CAC CCC TGC ACC AAC GGA GGC 784 
Lys Asp Leu Asn Tyr Cys Gly Ser His His Pro Cys Thr Asn Gly Gly 
140 145 150 
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ACG TGC ATC AAC GCC GAG CCT GAC CAG TAC CGC TGC ACC TGC CCT GAC 
Thr Cys He Asn Ala Glu Pro Asp Gin Tyr Arg Cys Thr Cys Pro Asp 

155 160 165 

GGC TAC TCG GGC AGG AAC TGT GAG AAG GCT GAG CAC GCC TGC ACC TCC 
Glv Tvr Ser Gly Arg Asn Cys Glu Lys Ala Glu His Ala Cys Thr Ser 

170 175 180 

AAC CCG TGT GCC AAC GGG GGC TCT TGC CAT GAG GTG CCG TCC GGC TTC 
Asn Pro Cys Ala Asn Gly Gly Ser Cys His Glu Val Pro Ser Gly Phe 

185 190 195 

GAA TGC CAC TGC CCA TCG GGC TGG AGC GGG CCC ACC TGT GCC CTT GAC 
Glu Cys His Cys Pro Ser Gly Trp Ser Gly Pro Thr Cys Ala Leu Asp 
?nn 205 210 215 

ATC GAT GAG TGT GCT TCG AAC CCG TGT GCG GCC GGT GGC ACC TGT GTG 
lie Asp Glu Cys Ala Ser Asn Pro Cys Ala Ala Gly Gly Thr Cys Val 

220 225 230 

GAC CAG GTG GAC GGC TTT GAG TGC ATC TGC CCC GAG CAG TGG GTG GGG 
Asp Gin Val Asp Gly Phe Glu Cys He Cys Pro Glu Gin Trp Val Gly 

235 240 245 

GCC ACC TGC CAG CTG GAC GCC AAT GAG TGT GAA GGG AAG CCA TGC CTT 
Ala Thr Cys Gin Leu Asp Ala Asn Glu Cys Glu Gly Lys Pro Cys Leu 

250 255 . 260 

AAC GCT TTT TCT TGC AAA AAC CTG ATT GGC GGC TAT TAC TGT GAT TGC 
Asn Ala Phe Ser Cys Lys Asn Leu He Gly Gly Tyr Tyr Cys Asp Cys 

265 270 275 

ATC CCG GGC TGG AAG GGC ATC AAC TGC CAT ATC AAC GTC AAC GAC TGT 
lie Pro Gly Trp Lys Gly He Asn Cys His He Asn Val Asn Asp Cys 
?m 285 290 29b 

CGC GGG CAG TGT CAG CAT GGG GGC ACC TGC AAG GAC CTG GTG AAC GGG 
Arg Gly Gin Cys Gin His Gly Gly Thr Cys Lys Asp Leu Val Asn Gly 

300 305 310 

TAC CAG TGT GTG TGC CCA CGG GGC TTC GGA GGC CGG CAT TGC GAG CTG 
Tyr Gin Cys Val Cys Pro Arg Gly Phe Gly Gly Arg His Cys Glu Leu 

315 320 325 

GAA CGA GAC AAG TGT GCC AGC AGC CCC TGC CAC AGC GGC GGC CTC TGC 
Glu Arg Asp Lys Cys Ala Ser Ser Pro Cys His Ser Gly Gly Leu Cys 

330 335 340 

GAG GAC CTG GCC GAC GGC TTC CAC TGC CAC TGC CCC CAG GGC TTC TCC 
Glu Asp Leu Ala Asp Gly Phe His Cys His Cys Pro Gin Gly Phe Ser 
345 350 355 



832 & 



880 
928 
976 
1024 
1072 
1120 
1168 
1216 
1264 
1312 
1360 
1408 
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GGG CCT CTC TGT GAG GTG GAT GTC GAC CTT TGT GAG CCA AGC 
Gly Pro Leu Cys Glu Val Asp Val Asp Leu Cys Glu Pro Ser 

360 3 65 370 

CGG AAC GGC GCT CGC TGC TAT AAC CTG GAG GGT GAC TAT TAC 
Ara Asn Gly Ala Arg Cys Tyr Asn Leu Glu Gly Asp Tyr Tyr 

380 385 
TGC CCT GAT GAC TTT GGT GGC AAG AAC TGC TCC GTG CCC CGC 
Cys Pro Asp Asp Phe Gly Gly Lys Asn Cys Ser Val Pro Arg 
3 395 400 405 

TGC CCT GGC GGG GCC TGC AGA GTG ATC GAT GGC TGC GGG TCA 
Cys Pro Gly Gly Ala Cys Arg Val He Asp Gly Cys Gly Ser 

410 415 420 

GGG CCT GGG ATG CCT GGC ACA GCA GCC TCC GGC GTG TGT GGC 
Gly Pro Gly Met Pro Gly Thr Ala Ala Ser Gly Val Cys Gly 

425 430 435 

GGA CGC TGC GTC AGC CAG CCA GGG GGC AAC TTT TCC TGC ATC 
Gly Arg Cys Val Ser Gin Pro Gly Gly Asn Phe Ser Cys He 
440 445 450 

AGT GGC TTT ACT GGC ACC TAC TGC CAT GAG AAC ATT GAC GAC 
Ser Gly Phe Thr Gly Thr Tyr Cys His Glu Asn He Asp Asp 

460 465 
GGC CAG CCC TGC CGC AAT GGG GGC ACA TGC ATC GAT GAG GTG 
Gly Gin Pro Cys Arg Asn Gly Gly Thr Cys He Asp Glu Val 
475 480 485 

TTC CGC TGC TTC TGC CCC AGC GGT TGG GAG GGC GAG CTC TGC 
Phe Arg Cys Phe Cys Pro Ser Gly TrD Glu Gly Glu Leu Cys 

490 495 500 

AAT CCC AAC GAC TGC CTT CCC GAT CCC TGC CAC AGC CGC GGC 
Asn Pro Asn Asp Cys Leu Pro Asp Pro Cys His Ser Arg Gly 

505 510 515 

TAC GAC CTG GTC AAT GAC TTC TAC TGT GCG TGC GAC GAC GGC 
Tyr Asp Leu Val Asn Asp Phe Tyr Cys Ala Cys Asp Asp Gly 
520 525 530 

GGC AAG ACC TGC CAC TCA CGC GAG TTC CAG TGC GAT GCC TAC 
Gly Lys Thr Cys His Ser Arg Glu Phe Gin Cys Asp Ala Tyr 

540 545 
AGC AAC GGT GGC ACC TGC TAC GAC AGC GGC GAC ACC TTC CGC 
Ser Asn Gly Gly Thr Cys Tyr Asp Ser Gly Asp Thr Phe Arg 
555 560 565 

TGC CCC CCC GGC TGG AAG GGC AGC ACC TGC GCC GTC GCC AAG 
Cys Pro Pro Gly Trp Lys Gly Ser Thr Cys Ala Val Ala Lys 
570 575 580 



CCC TGC 
Pro Cys 
375 
TGC GCC 
Cys Ala 
390 

GAG CCG 
Glu Pro 



GAC GCG 
Asp Ala 

CCC CAT 
Pro His 

TGT GAC 
Cys Asp 
455 
TGC CTG 
Cys Leu 
470 

GAC GCC 
Asp Ala 

GAC ACC 
Asp Thr 

CGC TGC 
Arg Cys 



TGG AAG 
Trp Lys 
535 
ACC TGC 
Thr Cys 
550 

TGC GCC 
Cys Ala 

AAC AGC 
Asn Ser 



1456 < 
1504 
1552 
1600 
1648 
1696 
1744 
1792 
1840 
1888 
1936 
1984 
2032 
2080 
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A6C TGC 
Ser Cys 
585 
GGG GCC 
Gly Ala 
600 

TGC ACT 
Cys Thr 

GGC ATC 
Gly He 

GGC TTC 
Gly Phe 

TCG CCC 
Ser Pro 
665 
CGC TGT 
Arg Cys 
680 

ATC GGG 
He Gly 



CTG CCC 
Leu Pro 

TCC TTC 
Ser Phe 

CAC AAT 
His Asn 

TGT GTT 
Cys Val 
635 
GCG GGG 
Ala Gly 
650 

TGT GCC 
Cys Ala 

AGC TGC 
Ser Cys 

TTC GGG 
Phe Gly 



AAC CCC TGT 
Asn Pro Cys 
590 

TCC TGC ATC 
Ser Cys He 

605 
ACC AAC GAC 
Thr Asn Asp 
620 

GAC GGC GTC 
Asp Gly Val 

CCT GAC TGC 
Pro Asp Cys 



GGA AGC 
Gly Ser 

CGC CGT 
Arg Arg 

GCC GGC 
Ala Gly 
745 
TGC CTG 
Cys Leu 
760 

TGG GGG 
Trp Gly 

CGC TCC 
Arg Ser 



TCC TGG 
Ser Trp 
715 
GAC TGC 
Asp Cys 
730 

CAG CCC 
Gin Pro 

GAG AAG 
Glu Lys 

GAG TGC 
Glu Cys 

GGC CAC 
Gly His 
795 



TAC GGG GCC 
Tyr Gly Ala 
670 

CCA CCC GGC 
Pro Pro Gly 

685 
AGA TCC TGC 
Arg Ser Cys 
700 

GTG GAA GAC 
Val Glu Asp 

AGC AAG GTG 
Ser Lys Val 

GAG GCC CTG 
Glu Ala Leu 
750 

GCC CCA GGC 
Ala Pro Gly 

765 
GGC GCA GAA 
Gly Ala Glu 
780 

CTG GAC AAT 
Leu Asp Asn 



GTG AAT GGT 
Val Asn Gly 

TGC CGG GAC 
Cys Arg Asp 

TGC AAC CCT 
Cys Asn Pro 
625 

AAC TGG TTC 
Asn Trp Phe 

640 
CGC ATC AAC 
Arg He Asn 
655 

ACG TGT GTG 
Thr Cys Val 



CGA GCC GGC 
Arg Ala Gly 



TGG TCC CGG 
Trp Ser Arg 
705 

TGC AAC AGC 
Cys Asn Ser 

720 
TGG TGC GGA 
Trp Cys Gly 
735 

AGC GCC CAG 
Ser Ala Gin 

CAG TGT CTG 
Gin Cys Leu 

GAG CCA CCG 
Glu Pro Pro 
785 

AAC TGT GCC 
Asn Cys Ala 
800 



GGC ACC 
Gly Thr 
595 
GGC TGG 
Gly Trp 
610 

CTG CCT 
Leu Pro 

CGC TGC 
Arg Cys 

ATC GAC 
lie Asp 

GAT GAG 
Asp Glu 
675 
CCC CGG 
Pro Arg 
690 

GGC ACT 
Gly Thr 



TGC GTG GGC AGC 
Cys Val Gly Ser 



GAG GGT 
Glu Gly 

TGC TAC 
Cys Tyr 

GAG TGT 
Glu Cys 
645 
GAG TGC 
Glu Cys 
660 

ATC AAC 
lie Asn 

TGC CAG 
Cys Gin 

CCG TTC 
Pro Phe 



TGC CGC 
Cys Arg 

TGG AAG 
Trp Lys 

TGC CCA 
Cys Pro 
755 
CGA CCA 
Arg Pro 
770 

AGC ACC 
Ser Thr 

CGC CTC 
Arg Leu 



TGC CTG 
Cys Leu 
725 
CCT TGT 
Pro Cys 
740 

CTG GGG 
Leu Gly 

CCC TGT 
Pro Cys 

CCC TGC 
Pro Cys 

ACC TTG 
Thr Leu 
805 



CGT ACT 
Arg Thr 
615 
AAT GGT 
Asn Gly 
630 

GCA CCT 
Ala Pro 

CAG TCC 
Gin Ser 

GGG TAT 
Gly Tyr 

GAA GTG 
Glu Val 
695 
CCA CAC 
Pro His 
710 

GAT GGC 
Asp Gly 



CTG CTG 
Leu Leu 

CAA AGG 
Gin Arg 

GAG GCC 
Glu Ala 
775 
CTG CCA 
Leu Pro 
790 

CAT TTC 
His Phe 



2128 
2176 
2224 
2272 
2320 
2368 
2416 
2464 
2512 
2560 
2608 
2656 
2704 
2752 
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AAC CGT GAC CAC GTG CCC CAG GGC ACC 
Asn Arg Asp His Val Pro Gin Gly Thr 

810 815 
GGG ATC CGC TCC CTG CCA GCC ACA AGG 
Gly He Arg Ser Leu Pro Ala Thr Arg 

825 830 
CTG GTG TTG CTT TGC GAC CGG GCG TCC 
Leu Val Leu Leu Cys Asp Arg Ala Ser 
840 845 
GTG GCC GTG TCC TTC AGC CCT GCC AGG 
Val Ala Val Ser Phe Ser Pro Ala Arg 
860 

ATC CAG GGC GCG GCC CAC GCC ATC GTG 
He Gin Gly Ala Ala His Ala He Val 
875 880 
AAC AGC TCA CTG CTC CTG GCT GTC ACC 
Asn Ser Ser Leu Leu Leu Ala Val Thr 

890 895 
GTT ACG GGC GGC TCT TCC ACA GGT CTG 
Val Thr Gly Gly Ser Ser Thr Gly Leu 

905 910 
GCC TTC AGC GTG CTG TGG CTG GCG TGC 
Ala Phe Ser Val Leu Trp Leu Ala Cys 
920 925 
ACA CGC AAG CGC AGG AAA GAG CGG GAG 
Thr Arg Lys Arg Arg Lys Glu Arg Glu 
940 

GAG AGC GCC AAC AAC CAG TGG GCC CCG 
Glu Ser Ala Asn Asn Gin Trp Ala Pro 
955 960 
ATT GAG CGG CCG GGG GGG CAC AAG GAC 
He Glu Arg Pro Gly Gly His Lys Asp 

970 975 
TTC ACT CCA CCG CCG CGC AGG CGC TGC 
Phe Thr Pro Pro Pro Arg Arg Arg Cys 

985 990 
CCG TCA GGG AGG ATG AGG AGG ACG AGG 
Pro Ser Gly Arg Met Arg Arg Thr Arg 
1000 io° 5 
ACT CCC TGG AGG CGG AGA AGT TCC TCT 
Thr Pro Trp Arg Arg Arg Ser Ser Ser 
1020 



ACG GTG 
Thr Val 



GCT GTG 
Ala Val 

TCG GGG 
Ser Gly 
850 
GAC CTG 
Asp Leu 
865 

GCC GCC 
Ala Ala 

GAG GTC 
Glu Val 

CTG GTG 
Leu Val 



GTG GTC 
Val Val 
930 
AGG AGC 
Arg Ser 
945 

CTC AAC 
Leu Asn 



GGC GCC 
Gly Ala 
820 
GCA CGG 
Ala Arg 
835 

GCC AGT 
Ala Ser 

CCT GAC 
Pro Asp 

ATC ACC 
He Thr 

AAG GTG 
Lys Val 
900 
CCT GTG 
Pro Val 
915 

CTG TGC 
Leu Cys 



ATT TGC TCC 
He Cys Ser 

GAC CGC CTG 
Asp Arg Leu 



2800 



2848 



0 

V 



GCT GTG 
Ala Val 

AGC AGC 
Ser Ser 
870 
CAG CGG 
Gin Arg 
885 

GAG ACG 
Glu Thr 



GAG 2896 

Glu 

855 

CTG 2944 
Leu 

GGG 2992 
Gly 

GTT 3040 
Val 



CTG TGT GGT 3088 
Leu Cys Gly 



CGG CTG 
Arg Leu 

CCC ATC 
Pro He 



GTG CTC TAC CAG 
Val Leu Tyr Gin 
980 

CCG GGC CGG CCG 
Pro Gly Arg Pro 
995 

ATC TTG GCC GCG 
He Leu Ala Ala 

1010 
CAC ACA AAT TCA 
His Thr Asn Ser 
1025 



GTG TGG 
Val Trp 

CCG CGG 
Pro Arg 
950 
CGC AAC 
Arg Asn 
965 

TGC AAG 
Cys Lys 



TGG 3136 

Trp 

935 

GAG 31B4 
Glu 

CCC 3232 
Pro 

AAC 3280 
Asn 



GCC ACG CGG 
Ala Thr Arg 



3328 



GTG AGG AGG 3376 
Val Arg Arg 
1015 

CCA AAG ATC 3424 
Pro Lys lie 
1030 
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2SSSSSSSSSSSSSS5 
5SSSSSSSSSSSSSSS 

Arg Glu Val Gly Arg Leu Gin Leu Gly Arg Asp fro. 

„ GCC^ATG CCG TCT GCC V<* « « * « « - « * 
Gly Ala Met Pro Sen Ala Gly Pro Gly Gly Arg uy ^ 

108° _ 155 aaa Arr Arr AAA AAC AAA AAC CAA ATG TTT 

TTC TTT ATT TTG TGT AAA AAA ACC ACC AAA AAC AAA ^ ^ 

Phe Phe He Leu Cys Lys Lys Thr Thr Lys** Lys ^ 

S5SS5SSS5555SKS 
S IS S £ S S £ S 5 £ 5 ™ 5,5 K 8 

U 2°rrr apt rrr TPT ATG AAA GGA GAG AGC AAA GGG TGT CTG 
KSfSS £ S SS S W 31u Ser^s G,y Cys Leu 
1145 rrr ttt fir AfC AGA GGT TGT GCA CTG TTT 

r GAA TCT TCC TTT Sf« «J £ GGG TTT CTC TGT GOT CCA GGC 
Thr Glu Ser Ser Phe Tyr Ser Ser Leu Gly^ne ^ 
CAA AGT GCC GGT T ACC CAT « TGT GTT ffl GTG GCC CAT « TGT 
Gin Ser Ala Gly Glu Thr His Gly Qysvai biy ^ 

S55SSSSS5SSSSSSS 
SSa£S5SS55Sg5SBB 

1225 1230 



3472 



% % 



3520 



3568 



3616 



3664 



3712 



3760 



3808 



3856 



3904 



3952 



4000 
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err rrr ACC CTG GTT ATT GAT GTG GCC CTG GCT GCC GGC ACG GCC CGT 4096 
^ y g t Uu W He Asp Val Ala Leu Alatfa G!y Thr Ala Ag. 

^C° TG T TG ACGCACCTGT 2 G3TTGTTAGT GGGGCCTGAG GTCATCGGCG TGGCCCAAGG 4154 

CCGGCAGGTC AACCTCGGGC TTGCTGGCCA GTCCACCCTG CCTGCCGTCT GTGCTTCCTC 4214 

CTGCCCAGAA CGCCCGCTCC AGCGATCTCT CCACTGTGCT TTCAGAAGTG CCCTTCCTGC 4274 

TGCGCAGTTC TCCCATCCTG GGACGGCGGC AGTATTGAAG CTCGTGACAA GTGCCTTCAC 4334 

ACAGACCCCT CGCMCTGTC CACGCGTGCC GTGGCACCAG GCGCTGCCCA CCTGCCGGCC 4394 

CCGGCCGCCC CTCCTCGTGA AAGTGCATTT TTGTAMTGT GTACATATTA AAGGAAGCAC 4454 
TCTGTATAAA AAAAAAAAAC CGGAATTCC 
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^CTAmGTCGTCAGGGATCTAKCC^ 

SgS 

cgtgccacmcggaggmgctgccta^ 

GCTGGACCAACTO 

======= 

TOTOGACTTACTGTG^ 
rATTTrTT^TGTCTGTQGTCCTW 

AATCCAACAAA^^TT WC^ffi WCCT ACTGTCATGAGMTATCMTGACTGTGAGAGCAACCCCTGT 
AAC^A^GTGAMCAAATATTMTGACTGCAG 

GAATGCCCAGCTGGACACGCTC 
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AGTGGhTGMTGCTGGCCTTCTMTCAGCAGCCTGTGMGACCAftATGCMTTCTGATTCTTATTACC 

MGATAATTGTGCCMCATCACCTTCACCTTTMTMGGAAATGATGGCACCAGGCCTTACCACGGAG 

CACATTTGCAGTGMTTGAG6MTCTGMTATCCTGAAGAATGTTTCTGCTGAATATTCCATCTATAT 

TACCTGTGAGCCnCACACnGGCAMTMTGAMTACATGnGCTATnCTGCTGAAGATATAGGAG 

MGATGAAMCCCMTCMGGAMTCACAGATAAGATTAnGACCTTGTCAGTAAGCGTGATGGAAAC 

MCAWCTMnGCTG(^TCGCAGMGTCAGAGTACAMGGCGACCAGTTMGMCAAMCAGATTT 

CnGGTGCCATTACTGAGCTWGTCnMCAGTAGCCTGGATCTGCTGTCTGGTMCTGTTTTC^^ 

GGTGCATTCAAMGCGCAGAAAGCAGAGCAGCCATACTCACACAGCATCTGATGACAACACCACCAAC 

MCGTAAGGGAGWGCTGMTWGAnAAAMCCCCATAGAGAAACACGGAGCAAATACTGTTCCAAT 

TAAAGACTATGAAMCAAAMCTCTAAMTCGCCAAAATAAGGACGCACAATTCAGAAGTGGAGGAAG 

ATGACATGGACAMCACCAGCAAMGGCCCGGTnGCCAAGCAGCCAGCGTACACTTTGGTAGACAGA 

GATGAAMGCCACCCAACAGCACACCCACAAMCACCCAMCTGGACAAATAAACAGGACAACAGAGA 

CTTGGAMGTGCACAAAGTTTAAATAGAATGGAGTACATTGTATAG 
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QVASAS6QFE LEILSVQNVN GVLQNGNCCD G7RNPGDKKC TRDECDTYFK. 50 

VCLKEYQ5RV TAGGPCSFGS KSTPVIGGNT FNLKYSRNNE KNRIVIPFSF 100 

AWPRSYTLLV EAWDYNDNST NPDRIIEKAS HSGMINPSRQ WQTLKHNTGA 150 

AHFEYQIRVT CAEHYYGFGC NKFCRPRDDF FTEHTCDQNG NKTCLEGWTG 200 

A A A A A A A AAA A A A A A A A A A *Q 5L DOMAIN* ** * ********* *** 

PECNKAICRQ GCSPKHGSCT VPGECRCQYG WQGQYCDKCI PHPGCVHGTC 250 

*** < EGF 1 x 

1EPWQCLCET NWGGQLCDKD LNYCGTHPPC LNGGTCSNTG PDKYQCSCPE 300 

EGF 2 x EGF 3 

GYSGQNCEIA EHACLSDPCH NGGSCLETST GFECVCAPGW AGPTCTDNID 350 

x EGF 4 

OCSPNPCGHG GTCQOLVDGF KCICPPQWTG KTCQLDANEC EGKPCVNANS 400 

>< EFG 5 x 

CRNLIGSYYC DCITGWSGHN COININDCRG QCQNGGSCRD LVNGYRCICS 450 

EFG 6- x -- EFG 7— 

PGYAGDHCEK DINECASNPC MNGGHCODEI NGFQCLCPAG FSGNLCQLDI 500 

-x— - EFG 8 — - 

DYCEPNPCQN GAQCFNLAMD YFCNCPEDYE GKNCSHLKDH CRHPCEVID 550 

-x - EFG 9- x 

SCTVAVASNS TPEGVRYISS NVCGPHGKCK SQAGGKFTCE CNKGFTGTYC 600 

EFG 10 - - 

HENINDCESN PCKNGGTCID GVNSYKC1CS DGWEGTYCET NINDCSKNPC 650 

x EFG 11 x 

HNGGTCROLV NDFFCECKNG WKGKTCHSRD SQCOEATCNN GGTCYDEGDT 700 

EFG 12 x 

FKCMCPAGWE GATCNIARNS SCLPNPCHNG GTCVVSGDSF TCVCKEGWEG 750 

EGF 13 x— EGF 14 

PTCTQNTNDC SPHPCYNSGT CVDGONWYRC ECAPGFAGPD CRININECQS 800 

x EGF 15 x- 

SPCAFGATCV DEINGYRCIC PPGRSGPGCQ EVTGRPCFTS IRVMPDGAKW 850 

EGF 16 > 

DDDCNTCQCL NGKVTCSKVW CGPRPCIIHA KGHNECPAGH ACVPVKEDHC 900 

<- CYSTEINE-RICH REGION 

FTHPCAAVGE CWPSNQQPVK TKCNSDSYYQ DNCANITFTF NKEMMAPGLT 950 

-> 

TEHICSELRN LNILKNVSAE YSIYITCEPS HLANNEIHVA ISAEDIGEDE 1000 
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NPIKEITDKI IDLVSKRDGN NTLIAAVAEV RVQRRPVKNK TDFLVPLLSS 1050 
VLTVAWICCL VTVFYWCIQK RRKQSSHTHT ASDDNTTNNV REQLNQIKNP 1100 
IEKHGANTVP IKDYENKNSK IAKIRTHN5E VEEDDMDKHQ QKARFAKQPA 1150 
YTLVORDEKP PNSTPTKHPN WTNKQDNRDL ESAQSLNRME YIV 1193 
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GAATTCGGCACGAGGI 1 1 II 1 1 1 1 1 H 1 1 h I CCCCTCTTTTCTTTCTTTTCCTnTGCC 



-+- 



.+.. + + +— - + 60 



ATCCGAAAGAGCTGTCAGCCGCCGCCGGGCTGCACCTMAGGCGTCGGTAGGGGGATAAC 
61 + + + + - + 12 ° 

AGTCAGAGACCCTCCTGAAAGCAGGAGACGGGACGGTACCCCTCCGGCTCTGCGGGGCGG 
121 - + + " + " + " + — + 180 



CT(3C6GCCCCTCCGTTOnCCCCCTCCCCGAGA6ACACTCnCCTnCCCCCCACGAAG 
181 — + + + + 



+— - + 240 



ACACAGGGGCAGGAACGCGAGCGCTGCCCCTCCGCCATGGGAGGCCGCTTCCTGCTGACG 
241 + — " " + + + + 



CTCGCCCTCCTCTCGGCGCTGCTGTGCCGCTGCCAGGTTGACGGCTCCGGGGTGTTCGAG 
301 - — + + " + + + 

CTGAAGCTGCAGGAGTnGTCMCAAGMGGGGCTGCTCAGCAACCGCAACTGCTGCCGG 
. 361 +~ + + + + " + 

GGGGGCGGCCCCGGAGGQaCCGGGCAGCAGCAGTGCGACTGCMGACCnCTTCCGCGTC 
421 + + - + + 



+ -+ 480 



TGCCTCAAGCACTACCAGGCCAGCGTCTCCCCCGAGCCGCCCTGCACCTACGGCAGCGCC 
481 ... + + + - - ♦ 540 

ATCACCCCCGTCCTCGGCGCCAACTCCTTCAGCGTCCCCGACGGCGCGGGCGGCGCC6AC 
541 — - + *" " + + -'" + 600 

CCCGCCnCAG(^CCCCATCCGCnCCCCmGGCTTCACCTGGCCCGGCACCTTCTCG 
601 - +— + + " - + — - + + 660 

CTCATCATCGAGGCTCTGCACACCGACTCCCCCGACGACCTCACCACAGAAAACCCCGAG 
661 — + -— + " + + + 720 

CGCCTCATCAGCCGCCTGGCCACCCAGAGGCACCTGGCGGTGGGCGAGGAGTGGTCCCAG 

721 ♦ + + + " + — + 780 

GACCTGCACAGCAGCGGCCGCACCGACCTCMGTACTCCTATCGCTTTGTGTGTGATGAG 
781 ....+ — -+ + + +-- — ► 840 
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CACTACTACGGGGAAGGCTGCTCTGTCTTCTGCCG6CCCC6T6AC6ACC6CTTCGGTCAC 
841 -+ + + + + + 900 

TTCACCrGTGGAGAGCGTGGCGAGAAGGTCTGCAACCCAGGCTGGAAGGGCCAGTACTGC 
901 + + + + +-- + 960 

ACTGAGCCGAmGCnGCCTGGGTGTGACGAGCAGCACGGCnCTGCGACAMCCTGGG 
961 + + + + + — -+ 1020 

GAATGCAAGTGCAGA6TGGGTTGGCAGGGGCGGTACTGT6ACGAGTGCATCCGATACCCA 
1021 + --+- + + + + 1080 

GGCTGCCTGCACGGTACCTGTCAGCAGCCATGGCAGTGCAACTGCCAGGMGGCTGGGGC 
1081 + + + — + +— + 1140 

GGCCTTnCTGCMCCAGGACCTGAACTACT 
1141 -+ +- + + + + 1200 

CGGTGTACGTGGTTGTGGCCAGTCCCCTCGATGTGAACAAGAACGGCTGGACCCATGTGT 
1201 -+ +- -+--- + + + 1260 

GGCTCCAGGTGCGAGATTGAMTCMCGMTGTGATGCCMCCCnGCAAGAATGGTGGA 
1261 + + + +— +— + 1320 

AGCTGCACG^TCTCGAGMCAGCTAnCCTGTACaGCCCCCCAGGCTTCTATGGTAAA 
1321 -+ + + + + + 1380 

MCTGTGAGCT6AGTGCAATGACTTGTGCTGATGGACCGTGCTTCAATGGAGGGCGATGC 
1381 + +- + + +-- + 1440 

ACTGACMCCCTGATGGTGGATAWGCTGCCGCTGCCWCTGGGTTATTCTGGGTTCAAC 
1441 +— + ----+ +— + + 1500 

TGTGAAMGAAMTCGAnACTGCAGTTCCAGCCCTTGTGCTAATGGAGCCCAGTGCGTT 
1501 + +-- + + + + 1560 

GACCTGGGGAACTCCTACATATGCCAGTGCCAGGCTGGCTTCACTGGCAGGCACTGTGAC 
1561 + + + + + + 1620 

GACMCGTGGACGATTGCGCCTCCTTCCCCTGCGTCAATGGAGGGACCTGTCAGGATGGG 
1621 -+ + + + --+ + 1680 
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GTCMCGACTACTCCTGCACCTGCCCCCCGG6ATACAACGGGAAGAACTGCAGCACGCCG 
+ .._+ --+ + + + 1740 

GTGAGCAGATGCGAGCACMCCCCT6CCACAATGGGGCCACCTGCCACGAGAGAAGCAAC 

+ + + + + + 1800 

CGCTACGTGTGCGAGTGCGCTCGGGGCTACGGCGGCCTCAACTGCCAGTTCCTGCTCCCC 

_ + + + ..-+ +-- + I860 

GAGCCACCTCAGGGGCCGGTCATCGnGACTTCACCGAGAAGTACACAGAGGGCCAGAAC 
+ __ + + + -+ + 1920 

AGCCAGTTTCCCTGGATCGCAGTGTGCGCCGGGAnAnCTGGTCCTCATGCTGCTGCTG 
+ + + + +-- + 

TACWGTCGGTGTACGTCATATCAGMGAGAAAGATGAGTGCATWTAGCAACTGAGGTG 

+ + + + + + 2460 



1861 
1921 
2401 



TAAMCAGACGTGACGTGGCAMGCnATCGATACCGTCATCAAGCTT 
2461 



+ + + — -+ 2508 
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10 20 30 40 50 60 

* * * 

TACGATGAAY AACCTGGCGA ACTGCCAGCG TCAGAAGGAC ATCTCAGTCA GCATCATCGG 

70 80 90 100 110 120 

* * * 

GGCYACGTCA GATCARGAAC ACCAACAAGA AGGCGGACTT YMCASCGGGG GACCASAGCG 

130 140 150 160 170 180 

* * * 

TCCGACAAGA ATGGHTTTCA AGGCCYGCTA CCCCAGCGTG GACTATAACT CGTGCAGGAC 

190 200 210 220 230 240 

* * * 

CTCAAGGGTG ACGACACCGC CGTCAGGACG TCGCACAGCA AGCGTGACAC CAAGTGCCAG 

250 260 270 280 290 300 

* * * 



TCCCCAGGCT CCTCAGGGAG GAGAAGGGGA CCCCGACCAC ACTCAGGGGK TGCGTGCTGC 



310 320 330 340 350 360 

* * * 

GGGCCGGGCT CAGGAGGGGG TACCTGGGGG GTGTCTTCCT GGAACCACTG CTCCGTTTCT 
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370 380 390 400 410 420 

" * * * 

CTTCCCAAAT GTTCTCATGC ATTCATT6TG 6ATTTTCTCT ATTTTCCTTT TAGTCGASAA 



430 440 450 460 470 480 

* * * 

GCATCTGAAA GAAAAAGGCC GGACTCGGGC TGTTCAACTT CAAAAGACAC CAAGTACCAG 



490 500 510 520 

* * 

TCGGTGTACG TCATATCCGA GGAGAAGGAC GAGTGCGTCA TCGCA 
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10 20 30 . 40 50 60 

************ - 

CATTGGGTAC GGGCCCCCCT CGAGGTCGAC GGTATCGATA AGCTTGATAT CGAATTCCGG 
70 80 90 100 110 120 

** ** * * ** ** ** 

CTTGACCTGG CCGGGCACCT TCTCTCTGAT TATTGAAGCT CTCCACACAG ATTCTCCTGA 
130 140 150 160 170 180 

** ** ** ** ** ** 

TGACCTCGCA ACAGAAAACC CAGAAAGACT CATCAGCCGC CTGGCCACCC AGAGGCACCT 
190 200 210 220 230 240 

** ** **.** ** ** 

GACGGTGGGC GAGGAGTGGT CCCAGGACCT GCACAGCAGC GGCCGCACGG ACCTCAAGTA 
250 260 270 280 290 300 

** ** ** ** ** ** 

CTCCTACCGC TTCGTGTGTC ACCAACACTA CTACGGAGAG GGCTGCTCCG TTTTCTGCCG 
310 320 330 340 350 360 

** ** * * ** ** * * 

TCCCCGGGAC GATGCCTTCG GCCACTTCAC CTGTGGGGAG CGTGGGGAGA AAGTGTGCAA 
370 380 390 400 410 420 

* * ** ** ** ** ** 
CCCTGGCTCG AAAGGGCCCT ACTGCACAGA GCCGATCTGC CTGCCTGGAT GT6ATGAGCA 

430 440 450 460 470 480 

** ** ** * * ** ** 

GCATGGATTT TGTGACAAAC CAGGGGAATG CAAGTGCAGA GTGGGCTGGC AGGGCCGGTA 
490 500 510 520 530 540 

* * ** ** ** ** ** 

GTGTGACGAG TGTATCCGCT ATCCAGGCTG TCTCCATGGC ACCTGCCAGC AGCCCTGGCA 
550 560 570 580 590 600 

** ** ** ** ** ** 

GTGCAACTGC CAGGAAGGNT GGGGGGGCCT TTTCTGCAAC CAGGACCTGA ACTACTGCAC 
610 620 630 640 650 660 

** ** ** * * ** ** 
ACACCATAAG CCCTGCAAGA ATGGAGCCAC CTGCAACAAA CACGGGCCAG GGGGAGCTAC 
670 680 690 700 710 720 

** ** ** ** ** ** 

ACTTGGTCTT TGGCCGGNCT GGGGTACANA GGGTGGCACC TGC6AAGCTT GGGGATTGGA 
730 740 750 760 770 780 

* * ** ** ** ** ** 
CGAGTTGTTG ACCCCAGCCC TTGGTAAGAA CGGAGGGAGC TTGACGGATC TTCGGAGAAC 

790 800 810 820 830 840 

* * ** ** ** ** ** 
AGCTACTCCT GTACCTGCCC ACCCGGCTTC TACGGCAAM TCTGTGAATT GAGTGCCATG 

850 860 870 880 890 900 

** ** ** ** ** ** 
ACCTGTGCGG ACGGCCCTTG CTTTAACGGG GGTCGGTGCT CAGACAGCCC CGATGGAGGG 
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DECEIVED 

M 09 ?003 



910 920 930 940 

* * * * ** * * 

TACAGCT6CC GCTGCCCCGT GGGCTACTCC GGCTTCAACT 
970 980 990 1000 

* * ** * * * * 

TGCAGCTCTT CACCCTGTTC TAATGGTGCC AAGTGTGTGG 
1030 1040 1050 1060 

* ★ * * * * * * 

TGCCGCTGCC AGGCCGGCTT CTCGGGGAGG CACTGTGACG 
1090 1100 1110 1120 

* * * * ** ** 

TCCTCCCCGT GCGCCAACGG ACCTCGGTGA CGGGATGGCG 
1150 1160 1170 1180 

* * ** ** ** 

TGCCCGCCTG GCTACACGGG CAGGAACTGC AGTGCCCCCG 
1210 1220 1230 1240 

* * ** ** ** 

CCCTGCCACA ATGGGGCCAC CTGCCACGAG AGGGGCCACC 
1270 1280 1290 1300 

* * ** ** ** 

CGAAGCTACG GGGGTCCCAA CTCCCANTTC CTGCTCCCCC 
1330 1340 1350 1360 

* * ** ** ** 

CGGTGGTGGA AACTCCCCTA AAAAAACCTA AAAGGGCCGG 
1390 1400 1410 1420 

* * * * * * ** 

GGACGTGTGC GCCGGGGTCA TCCTTGTCCT CATGCTGCTG 
1450 1460 1470 1480 

* * * * * * ** 

GGTCTGCGTC CGGCTGAGGC TGCAGAAGCA CCGGCCCCCA 
1510 1520 1530 1540 

* * * * * * ** 

GACGGAGACC ATGAACAACC TGGNCAACTG CCAGCGTGAG 
1570 1580 1590 1600 

* * * * * * * * 

CATCGGGGNC ACGCAGATCA AGAACACCAA CAAGAAGGCG 
1630 1640 1650 1660 

* * ** ** ** 

NGCCGACAAG AATGGCTTCA AGGCCCGCTA CCCAGNGGTG 
1690 1700 1710 1720 

** ** ** ** 

CCTCAAGGGT GACGACACCG CCGTCAGCCA CGCGCACAGC 
1750 1760 1770 1780 

** ** ** ** 

GCCCCAG6GC TCCTCAGGGG AGGAGAAGGG GACCCCCGAC 



950 

* * 

GTGAGAAGAA 

1010 

* * 

ACCTCGGTGA 

1070 

* * 

ACAACGTGGA 

1130 

* * 

TGAACGACTT 

1190 

* * 

CCAGCACCTG 

1250 

* * 

GCTATNTGTG 

1310 

* * 

AAACTGCCCC 
1370 

* * 

GGGGGGCCCA 

1430 

* * 

CTGGGCTGTG 

1490 

* * 

GCCGACCCa 
1550 

* * 

AAGGACATCT 

1610 

* * 

GACTTCCACG 

1670 

* * 

GACTATAACC 

1730 

* * 

AAGCGTGACA 

1790 

* * 

CCACACTCAG 



960 

* * 

AATTGACTAC 

1020 

* * 

TGCCTACCTG 

1080 

* * 

CGACTGCGCC 

1140 

* * 

CTCCTGCACC 

1200 

* * 

CGAGCACGCA 

1260 

* * 

CGAGCACGCA 

1320 

* * 

CCCGGCGCCA 

1380 

* * 

TCCCCTTGGT 

1440 

* * 

CCGCTGTGGT 

1500 

* * 

GNCGGGGGGA 

1560 

* * 

CAGTCAGCAT 

1620 

* * 

GGGACCACAG 

1680 

* * 

TCGTGCAGGA 

1740 

* * 

CCAAGTGNCA 

1800 

* * 

GGGGTG6AGG 
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<§5 





1810 1820 1830 1840 1850 1860 

**• ** ** * * ** * * 
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